APPROXIMATION THEORY OF MATRIX RANK MINIMIZATION 
AND ITS APPLICATION TO QUADRATIC EQUATIONS 
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Abstract. Matrix rank minimization problems are gaining a plenty of recent attention in both 
mathematical and engineering fields. This class of problems, arising in various and across-discipline 
applications, is known to be NP-hard in general. In this paper, we aim at providing an approximation 
theory for the rank minimization problem, and prove that a rank minimization problem can be 
approximated to any level of accuracy via continuous optimization (especially, linear and nonlinear 
semidefinite programming) problems. One of the main results in this paper shows that if the feasible 
set of the problem has a minimum rank element with the least F-norm (i.e., Frobenius norm), then 
the solution of the approximation problem converges to the minimum rank solution of the original 
problem as the approximation parameter tends to zero. The tractability under certain conditions and 
convex relaxation of the approximation problem are also discussed. The methodology and results 
in this paper provide a new theoretical basis for the development of some efficient computational 
methods for solving rank minimization problems. An immediate application of this theory to the 
system of quadratic equations is presented in this paper. It turns out that the condition for such a 
system without a nonzero solution can be characterized by a rank minimization problem, and thus 
the proposed approximation theory can be used to establish some sufficient conditions for the system 
to possess only zero solution. 
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1. Introduction. Throughout the paper, let R n be the n-dimensional Euclidean 
space, R mxn be the m x n real matrix space, and S n be the set of real symmetric 
matrices. When X,Y £ R mxn , we use (X, Y) = tr(X T Y) to denote the inner product 
of X and Y. \\X\\ and denote the spectral norm and Frobenius norm of X, 

respectively, and H-X'H* stands for the nuclear norm of X (which is the sum of singular 
values of X). A y (V 0) means that A £ S n is positive semidefinite (positive 
definite). Given an X E fi mx " with rank r, we use <j(X) to denote the vector 
(<7i (X), a r (X)) where ai(X) > ■ ■ ■ > <J r {X) > are the singular values of X. 

Let C C R mxn be a closed set. Consider the rank minimization problem: 

(1.1) Minimize {rank(X) : X £ C} , 

which has found many applications in system control [HI |H [SSI [57J [201 E3 ES] > matrix 
completion [0J [7J [37], machine learning [TJ [25], image reconstruction and distance 
geometry [23l l35l l33l l30l Hi] , combinatorial and quadratic optimization [2 [38], to 
name but a few. The recent work on compressive sensing (see e.g. [HI El [13] ) also 
stimulates an extensive investigation of this class of problems. In many applications, 
C is defined by a linear map A : R mxn — > RP . Two typical situations are 

(1.2) C = {X £ R mxn : A(X) = b}, 

(1.3) C = {X eS n : A(X) = b, X h 0}. 

Unless C has a very special structure, the problem (|1.1[) is difficult to solve due to 
the discontinuity and nonconvexity of rank(JT). It is NP-hard since it includes the 
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cardinality minimization as a special case [30] . The existing algorithms for (jl.ip 
are largely heuristic-based, such as the alternating projection [T51 [TT], alternating 
LMIs [32, and nuclear norm minimization (see e.g. [T51 HH [3UJ [3H |3T] . The idea 
of the nuclear norm heuristic is to replace the objective of by the nuclear norm 
||A||», and to solve the following convex optimization problem: 

(1.4) Minimize {||X||* : X G C}. 

Under some conditions, the solution to the nuclear norm heuristic coincides with 
the minimum rank solution (see e.g. [151 130| I31j). This inspires an extensive and 
fruitful study on various algorithms for solving the nuclear norm minimization problem 
[H [3UJ [531 QB 131 EH 13 . However, as pointed out in [301 [31], the nuclear norm 
heuristic exhibits a phase transition where for sufficiently small values of the rank the 
heuristic always succeeds, but in the complement of the region, it may fail or never 
succeed. While the nuclear norm is the convex envelop of rank(X) on the unit 

ball {X : \\X\\ < 1} (see [TB, 30 ), it may have a drastic deviation from the rank of X 
since rank(A) is a discontinuous concave function, and hence it is not a high-quality 
approximation of rank(A). As a result, the true relationship between (jl.lj) and (ll.4p 
are not known in many situations unless some strong assumptions such as "restricted 
isometry" hold [301 . 

In this paper, we develop a new approximation theory for rank minimization prob- 
lems. We first provide a continuous approximation for rank(A), by which rank(A) 
can be approximated to any prescribed accuracy, and can be even computed exactly 
by a suitable choice of the approximation parameter. Based on this fact, we prove 
that (jl.ip can be approximated to any level of accuracy by a continuous optimization 
problem, typically, a structured linear/nonlinear semidefinite programming (SDP) 
problem. One of our main results shows that when the feasible set is of the form 
(|f .3[) . and if it contains a minimum rank clement with the least F-norm (i.e. Frobe- 
nius norm), then the rank minimization problem can be approximated to any level 
of accuracy via an SDP problem, which is computationally tractable. A key feature 
of the proposed approximation approach is that the inter-relationship between (|l.lj) 
and its approximation counterpart can be clearly displayed in many situations. The 
approximation theory presented in this paper, aided with modern convex optimiza- 
tion techniques, provides a theoretical basis for (and can directly lead to) both new 
heuristic and exact algorithms for tackling rank minimization problems. 

To demonstrate an application of the approximation theory, let us consider the 
system 

(1.5) x T A l x = 0, i — 1, m, x G R n , 

where Ai G S n ,i = l,...,m. A fundamental question associated with (|1.5|) is: when is 
'x = 0' the only solution to (L5j? The study of this question (e.g. [TTl [T2l l5[ I36 [ [22] ) 
can be dated back to the late 1930s. For m = 2 and n > 3, the answer to the 
question is well-known: is the only solution to x T A\x = 0, x T A2X = if and only 
if \x\A\ + [L2A1 >~ for some /ii,/i2 G R- However, this result is not valid for n = 2, 
or for m > 3. In fact, the condition 

m 

(1.6) '^^fiiAi >~ for some H\, •••,/x m € R 

i—l 

implies that is the only solution to (11.51) . but the converse is not true in general. 
When n — 2 and/or m > 3, the sufficient condition (|1.6p may be too strong. Thus 
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finding a mild sufficient condition for the system (jl.5l) with only zero solution is 
posted as an open problem in [3T]. We first show that the study of this problem can 
be transformed equivalently as a rank minimization problem, based on which we use 
the proposed approximation theory, together with the SDP relaxation and duality 
theory, to establish some general sufficient conditions for the system with only zero 
solution. 

This paper is organized as follows. In section 2, an approximation function of 
rank(X) (and thus an approximation model for the rank minimization problem) is 
introduced, and some intrinsic properties of this function are shown. In section 3, 
reformulations and modifications of the approximation counterpart of the rank min- 
imization problem are discussed, and their proximity to the original problem is also 
proved. The application of the approximation theory to the system of quadratic equa- 
tions has been demonstrated in section 4. Conclusions are given in the last section. 

2. Generic approximation of rank minimization. The objective of this sec- 
tion is to provide an approximation theory that can be applied to general rank mini- 
mization problems, without involving a specific structure of the feasible set which is 
only assumed to be a closed set (and bounded when necessary, but not necessarily 
convex). In order to get an efficient approximation of the problem it is natu- 

ral to start with a sensible approximation of rank(X). Let us consider the function 
4> e : R mxn -> R defined by 

(2.1) <j> e {X) =tr(X(X T X + eiy 1 X T ) , £>0. 

The first result below claims that the rank of a matrix can be approximated (in terms 
of (j>s) to any prescribed accuracy, as long as the parameter e is suitably chosen. 

Theorem 2.1. Let X G Rjnxn ^ a ma ^ r i x w n}i rank(X) — r, and (f> e be defined 
by \2. Then for every e > 0, 

MX)) 2 
-M*)) 2 + e' 

where o~i(X) 's are the singular values of X, and the following relation holds: 
(2.3) < rank(X) - M X) = ± £ - < e± for all e > 0. 

Proof. Let X — U"SV T be the full singular value decomposition, where U, V 
are orthogonal matrices with dimensions m and n, respectively, and the matrix S = 



(2.2) 



diag(a(X)) r> 



0(m— r)xr 0(m — r) X (n— r) 



1 ' where pxq denotes the px q zero matrix. Let o~ 2 (X) 

denote the vector ((01 (X)) 2 , (o- r (X)) 2 ). Note that 



X T X + eI = V(^)V T +sI = v[ diag(a 2 (X))+ £ / r , ^ 



Sl n . 

where I is partitioned into two small identity matrices I r and I n - r . Thus, we have 
<j> e {X) = tr (X{X T X + el^X?) 



tr /7diag(^(X)) +£ / r £ 0Y\^ 



tr 



£ 



diag(a 2 (X))+e/ r \ V diag^pT)) 
£/„-,- J \ 



Clearly, < r = rank(X) for all e > 0. Note that 

rank W - M X) =±(l- = £ j-^- < £ 



Thus the inequality (pTS]) holds. □ 

From the above result we have 4> e {X) < rank(X) and lim £ ^o 0e(-^Q = rank(X). 
So, we immediately have the following corollary. 

Corollary 2.2. For every matrix X € R mxn , there exists accordingly a number 
e* > such that rank(X) = \cj> e (X)] for all e G (0,e*]. 

This suggests the following scheme which requires only a finite number of iter- 
ations to find the exact rank of X: Step 1. Choose a small number e > 0; Step 2. 
Evaluate (t> e {X) at X; Step 3. Round up the value of 4> e (X) to the nearest integer; 
Step 4- Set e ^ fie where [3 G (0, 1) is a given constant, and repeat the steps 2-4 
above. 

The threshold e* in Corollary 2.2 depends on X. This can be seen clearly from 
the right-hand side of (|2.3p . However, the next theorem shows that over the optimal 
solution set of the approximation is uniformed. Before stating this result, we 
first show that the optimal solution set of is closed. Note that, in general, the 
set {X G C : rank(X) = r} is not closed. 

Lemma 2.3. Let C be a closed set in R mxn . Then the level set {X G C : 
rank(X) < r} is closed for any given number r > 0. In particular, the optimal solution 
set of il.l]) , i.e., C* — {X G C : rank(X) — r*} is closed, where r*(= mm{rank(X) : 
X G C}) is the minimum rank. 

Proof. Suppose that {X k } C {X G C : rank(X) < r} is a sequence convergent to 
X° in the sense that \\X k - X°\\ -» as k -> oo. Let r° = rank(X°) and oi(X°) > 
••■ > a r o(X°) > be the nonzero singular values of X°. Note that the singular 
value is continuously dependent on the entries of the matrix. It implies that for 
sufficiently large k, X k has at least r° nonzero singular values. Thus rank(X°) < 
rank(X' £ ) < r for all sufficiently large k. This together with the closedness of C 
implies that X° G {X G C : rank(JT) < r}, and thus the level set of rank(X) is 
closed. Particularly, it implies that the optimal solution set {X G C : rank(X) = 
r*} = {X G C : rank(X) < r*} is closed. □ 

We now show that the function rank(X) can be uniformly approximated by 4> e {X) 
over the optimal solution set of (jl.lj) . in the sense that the right-hand side of (|2.3j) is 
independent of the choice of X*. 

Theorem 2.4. If the optimal solution set, denoted by C* , of (1.1]) is bounded, 
then there exists a constant 5 > such that for any given e > the inequality 



4>e{X*) < rank{X*) < 4> e {X*) + e 



f min{m, n} 
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holds for allX* € C*. 

Proof. Let r* be the minimum rank of Then r* = rank(X*) for all I' e C*. 

Let a r *(X*) denote the smallest nonzero singular value of X*, and denote 

o-min = mm{a r ,(X*) : X* G C*}. 

We now prove that o~ m in > 0. Indeed, if <r m m = 0, then there exists a sequence 
{X£} C C* such that ay« (^) — > 0. Since C* is bounded, passing to a subsequence 
if necessary we may assume that X£ — > X. Thus, a r *{X) = 0, which implies that 
rank(X) < r*, contradicting to the closedness of C* (see Lemma 2.3). Therefore, we 
have Oram > 0. Let S > be a constant satisfying (5 < o~ m i n . By (|2.3p . we have 

, ^ , v-^ 1 r* /min|m, n\\ 

rank(X*) - M X*) < ^J^^ < Sj^^ < s (— ^ j , 

as desired. □ 

It is easy to see from (|2.ip that <fi s (X) is continuous with respect to (X,e) over 
the set R mxn x (0, oo). From Theorem 2.1 and Corollary 2.2, we see that the problem 
(II. lj) can be approximated by a continuous optimization problem with (f> £ . In fact, by 
replacing rank(JT) by <f> e {X), we obtain the following approximation problem of (|1. II) : 

, . Minimize e (JT) = tr (X(X T Jf + e/)- 1 ^) 

(2 ' 4j s.t. XeC 

where e > is a given parameter. From an approximation point of view, some natural 
questions arise: Does the optimal value (solution) of ()2.4[) converges to a minimum 
rank (solution) of as e — > 0? How can we solve the problem (|2.4[) efficiently, and 
when this problem is computationally tractable? The remainder of this section and 
the next section are devoted to answering these questions. 

For the convenience of the later analysis, we use notation <po(X) = rank(X). 
Before we prove the main result of this section, let us first prove the semicontinuity 
of the function <fi £ (X) at the boundary point s = 0, 

Lemma 2.5. With respect to (X, e), the function 4> £ (X) is continuous every- 
where in the region R mxn x (0, oo), and it is lower semicontinuous at (X, 0), i.e., 

liminf d>JY) > MX) = rank(X). 



Proof. The continuity of <\> z in R mxn x (0, oo) is obvious. We only need to 
prove its lower semicontinuity at (X, 0). Let X be an arbitrary matrix in R mxn with 
rank(X) = r. Suppose that X — > X. Then it is easy to see that 

(2.5) (<T 1 (X),...,a r (X))->(* l (X),...,* r (X))>0, 

and ai{X) — > for i > r + 1. This implies that rank(X) > rank(X) as long as X is 
sufficiently close to X. By (|2.2[) . we have 

cf> s (X) - MX) = MX) - rank(X) 

(2fi\ y fo(*)) 2 , y ( i<X)? 

[ ■ ' i ^ +i (a l {X)Y + e + ^ i \(a l {X)Y + e 



It is not difficult to see that when (X, e) —> (X, 0), the right-hand side of (|2.6p does not 
necessarily tend to zero, when (<7i(X)) 2 in the first term of the right-hand side of (|2.6p 
tends to zero no faster than that of e. For instance, let (oi(X), ...,a r (X)) = (1, 1), 
and consider the sequence X k — > X where X k satisfies that rank(X fc ) = p > r, 
(cri(A: fc ),...,(r r (Jr fc )) = (1,...,1) and = (1/fc, 1/fc). Setting 

£fe = -p- and substituting (X fc ,e fc ) into (|2.6[) yields 

£fc (* fe ) - MX) = \{P -r)- ^\(P- r )>° as k -> °°- 



So, <f> s (X) is not necessarily continuous at e = 0. However, from (|2.6p we see that 
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where r = rank(X). By (|2.5|) . the right-hand side of the above goes to zero as X — > X 
and e — > 0. It follows that 

liminf M*)> Uminf ( foC?) + E f 7 T^WZ ~ 1 )) = M*)- 

(X,e)^(X,0) (X,e)^(X,0) \ ^VfoW) + e // 

The proof is complete. □ 

We now prove the main result of this section, which shows that the rank mini- 
mization over a bounded feasible set can be approximated with (|2.4p to any level of 
accuracy. 

Theorem 2.6. Let C be a closed set in R mxn . Let r* be the minimum rank 
of {HIP an d f 0T given e > 0, let (f>* and X(e) be the optimal value and an optimal 
solution of ^2.4\l, respectively. Then 



(2.7) (j)* < r* for any e > 0. 
Moveover, when C is bounded, then 

(2.8) lim 0*=r*, 

and any accumulation point of X(e), as e — > 0, is a minimum rank solution of il.ljl . 
Proof. Since X(e) is an optimal solution to (|2.4p . we have 

0* = 4> e {X{e)) < </> e (X) forallXeC. 

Particularly, any optimal solution of (jl.ip satisfies the above inequality. So, <j)* < 
(f> e (X*) < rank(X*) = r* where the second inequality follows from (|2.3p . and e can 
be any positive number. Thus (12 . T[) holds, and 

(2.9) limsup0:<r*. 

On the other hand, since 4>* > 0, the number r = liminf e _j.o 4>t 1S finite. Without loss 
of generality, assume that the sequence {4>t k }, where Sk — > as k — > oo, converges 
to r. Note that X(ek) is a minimizer of (|2.4p with e = i.e., 4>* k = </> £fc (X (£&)). 
When C is bounded, the sequence {X(ek)} is bounded. Passing to a subsequence if 
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necessary, we assume that X(sk) — > X as k — >• oo. Clearly, X € C since C is closed, 
and hence rank(X ) > r*. Therefore, 



r = lim cf>* = lim £fc (X e J > hminf > M X o), 

fc^OO " fc-^OO (X,£)-S-(X O ,0) 

where the last inequality follows from Lemma 2.5. Thus, r > </>o(Xo) = rank(Xo) > 
r*, which together with (12.9[) implies (I2.8[) . 

We now prove that any accumulation point of X(e) is a minimum rank solution 
of (|l.ip . Let X (with rank(X) = r) be an arbitrary accumulation point of X(e), 
as e — > 0. We now prove that X is a minimum rank solution to ([l.ip . i.e., r = r* . 
Consider a convergent sequence X(sk) — > X where e k — > 0. Then by (I2.7[) and (|2.2p . 
we have 



MX( £fc ))) 2 , _ (^(X( £fe ))) 



2 



> 



^(tri(A:(e fc ))) 2 + efc ^{<Ji{X{e k W + e k 

X {^{X{E k ))Y X, {a t {X)f ^ 
> t — — — — rr^ > > = rankf X ) . 



Thus, any accumulation point of X(e) is a minimum rank solution to □ 

Since r* is integer, by (|2.7[) and (|2.8[) . we immediately have the corollary below. 
Corollary 2.7. Le£ r* and 0* be defined as m Theorem 2.6. If C C R mxn 

is bounded and closed, then there exists a number S > such that r* = \(f>*~\ for all 

£6(0,5]. 

The results above provide a theoretical basis for developing new approximation 
algorithms for rank minimization problems. Such an algorithm can be a heuristic 
method for general rank minimization, and can be an exact method as indicated by 
Corollary 2.7. From Theorem 2.6, the set {X(e)} can be viewed as a trajectory lead- 
ing to the minimum rank solution set of and thus it is possible to construct a 
continuation type method (e.g. a path-following method) for rank minimization prob- 
lems. In the next section, we are going to discuss how and when the approximation 
problem (|2.4[) can be efficiently dealt with from the viewpoint of computation. We 
prove that under some conditions problem (|2.4|) can be either reformulated or relaxed 
as a tractable optimization problem, typically an SDP problem. 

3. Reformulation of the approximation problem (12.4p . The main result in 
last section shows that if e is small enough, the optimal value of the rank minimization 
problem can be obtained precisely by solving (|2.4I) just once, and the solution X(e) of 
(12.41) is an approximation to the optimal solution of (jl.ip . If the problem (|2.4p with 
a prescribed e > fails to generate the minimum value of (ll.ip . we can reduce the 
value of e and solve (|2.4p again. By Corollary 2.7, the minimum rank of (|l.ip can 
be obtained by solving (12.41) up to a finite number of times. Thus, roughly speaking, 
solving a rank minimization problem amounts to solving a continuous optimization 
problem defined by (|2.4p . 

In this section, we concentrate on the problem ()2.4p to find out when and how 
it can be solved efficiently. To this end, we investigate its equivalent formulations 
together with some useful variants. By doing so, we take into account the structure 
of C when necessary. Let us start with the reformulation of (12.41) . 
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Introducing a variable Y G S m , we first note that (|2.4p can be written as the 
following nonlinear semidefinite programming problem: 



(3.1) Minimize {tr(Y) : Y y X(X T X + eiy x X T , X E C) . 
It is easy to see that if (Y*,X*) is an optimal solution to (13. ip . then 

(3.2) Y* =X*((X*) T X* +eiy 1 (X*) T . 

Thus, we conclude that X* is an optimal solution to (|2 .4[) if and only if (Y* ,X*) is an 
optimal solution to (|3.1I) where Y* is given by (|3.2I) . By Schur complement theorem, 
the problem (|3.1|) can be further written as 



(3.3) Minimize j tr(F) : £t x T X + sI^)-°> XeC 

which remains a nonlinear SDP problem. We now introduce the variable Z — X T X, 
which implies that Z is the optimal solution to the problem min^{tr(Z) : Z y X T X}. 

By Schur complement theorem again, Z y X T X is nothing but y ^ T * J y 0. 

So the problem (|2.4[) can be written exactly as a bilevel SDP problem: 



min tr(y) 



(3.4) s.t. ( J T ~* £i )yo, 



Z = argmin|tr(Z):( x / T *)^0 

From the discussion above, we conclude that (|2.4I) is equivalent to the nonlinear SDP 
problem (|3.3[) . and is equivalent to the linear bilevel SDP problem (|3.4p . As a result, 
by Theorem 2.6, i/ie ranfc minimization over a bounded feasible set is equivalent to 
the linear bilevel SDP problem of the form jg.^[ ). Thus, the level of difficulty for 
rank minimization can be understood from the perspective of its linear bilevel SDP 
counterpart. It is worth mentioning that the bilevel programming (in vector form) has 
been long studied (e.g. [23]), but to our knowledge the bilevel SDP problem remains 
a new topic so far. The analysis above shows that a bilevel SDP model does arise 
from rank minimization. However, both (|3.3I) and (|3.4[) are not convex problems, and 
hence they are not computationally tractable in general. 

This motivates us to consider the next approximation model which can be viewed 
as a variant of (|2.4p . The difficulty of (|3.3p and (|3.4p lies in the hard equality Z = 
X T X. An immediate idea is to relax it to Z y X T X, yielding the problem: 

Minimize jtr(F) : ( J T z + e/ ) ^ °^ ( J ) >= 0, X E C 

which is a convex problem if C is convex, and an SDP problem if C is defined by 
(|1.2p or (II. 3p . However, for any given X £ C and any number j3 > 0, the point 
(Y = (31, Z = al) is feasible to the above problem when a > is sufficiently large. 
So the optimal value of the above problem is always zero, providing nothing about 
the minimum rank of the original problem (jl.ip . This happens since Z gains too 



much freedom while Z — X T X is relaxed to Z >z X T X. Thus the value tr(Y) = 
tx{X{Z + eI)~ 1 X T ) may significantly deviate from (j) e (X)(~ rank(X)). To avoid this, 
some driving force should be imposed on Z so that it is near (or equal) to X T X. 

Motivated by this observation, we consider the following problem in which a 
'penalty' term is introduced into the objective: 



Minimize tr(Y) + -tr(Z) 

.... ( %r z + eA-°'{ X T Z )•-• V ' 



where 7 is a positive number. The term -tv(Z) acts as a penalty when Z(y X T X) 
is deviated away from X T X. Since tr(Z) > tr(X T X) = this term also drives 

H-X'Hf to be minimized. Note that when Z is driven near to X T X, it is the first term 
tr(Y) of the objective that approximates the rank of X, and returns the approximate 
value of rank(X). The advantage of the approximation model ()3.5[) is that it is an 
SDP problem when C is defined by linear constraints (such as (|1.2|) or (|f .3jl ). and 
hence it is computationally tractable. In what follows, we concentrate on this model 
and prove that under some conditions the rank minimization can be approximated by 
(|3.5[) to any level of accuracy. 

Theorem 3.1. Let C be a bounded, closed set in R mxn . Suppose that C contains a 
minimum rank element X* with the least F- norm, i.e., rank(X*) < rank(X), \\X*\\p < 
\\X\\ F for all X e C. Let (Y e . 7 , Z £i7 , X St ~f) denote the optimal solution of the problem 
nrify . Then tr[Y e „) < 4> e {X '*) < rank(X*) for all (e, 7 ) > and 

lim tr(Y E . t ) = r* = ranMX*), lim tr(Z ei ) = ||X*|||, 

(e,7)-)-0,J-)-Q ' (e,7)->0, J->0 

and the sequence {X e/y } converges to the set of minimum rank solutions of il.l]) , as 
(e, 7 ) -t and 7 0. 

Proof. Since C is bounded and closed, the sequence {X en } has at least one 
accumulation point, and any such an accumulation point is in C. Let X° be an 
arbitrary accumulation point of the sequence {X e/y } as (e, 7) — > and ^ — > 0. Without 
loss of generality, we assume that X en — > X°, as (e, 7 ) — > and - — > 0. By Schur 
complement and the structure of the problem (|3.5I) , it is easy to see that for any given 
e,7 > the optimal solution (Y £j7 , Z E:1 ,X e , 1 ) of (|3 . 5|) satisfies the following relation 

(3.6) Y,*y — X e ^(Z Er/ + el) 1 X^, Z e . 7 y X^X St y. 

Let X* be an arbitrary minimum rank solution of (|1.1[) with the least F-norm. Then 
the point {Y*,Z*,X*), where Y E * = X* {{X*) T X* + e)- 1 {X*) T and Z* = (X*) T ' X* , 
is feasible to the problem (|3.5p . By optimality, we have 

(3.7) tr(Y £>7 ) + -tr(Z £ , 7 ) < tr(Y £ *) + -tr(Z*) = J> S (X*) + -\\X*\\%. 

Ill 



It follows from (J3T6J) that 

(3.8) tr(Z £ , 7 ) > ti(Xl 7 X en ) = \\X en \\ F > \\X*f F foraUe,7>0. 
Combining (|3T7)) and ([3T5]| yields 

(3.9) tr(Y £)7 ) < tf> £ (X*), 

(3.10) O<tr(Z £)7 )-||X*||^<7(0 £ (X*)-tr(y e , 7 ))<70 £ (X*)<7rmn{m,n}, 

9 



for all (£,7) > 0. The last inequality of (l3~TU)) follows from <j) E {X*) < rankpT) < 
min{m,n}. Let X en — X° + A £ where A £ 7 — > since X En — > X°. Then 

X^X en = (X°fX° + (A^X° + A,,, + A^ 7 A e , 7 ) = (X°) T X° + G(A £ , 7 ), 

where G(A £i7 ) = A^X + (X°) T A sn + A^ 7 A £i7 . Thus by $M$> we have 

(3.11) Z en h X^X En = {X°) T X {) + G(A £)7 ). 

Note that tr(G(A £j7 )) as A £i7 0. By (jXTUl) and PTTj) . we have 

'* 112 - lim tr(Z^) > lim ^" v °^ y0 



F = mju tr(Z £ , 7 )> lim tr((XY* U + G(A £ , 7 ))HI*l 



1 1 2 



Thus, X° is a least F-norm element in C. On the other hand, from (|3.11D , we see that 



X £ T 7 ^ e ,7 h 0. Thus, by p~TT|) and pTTUj) again, we have 



||A £ , 7 || < tr(A £i7 ) = tr(Z £ , 7 ) - ||JC 8 , 7 ||J. < tr(Z B)T ) - ||X*|| F < 7min{m,n}. 

The first inequality above follows from the fact A £ . 7 y 0, and the second follows from 
II-X^IIj* > \\X*\\f- Therefore, 



(3.12) 



||A £)7 ||/e — ;> 0, as (£,7) -> and 7/e -> 0. 



When M G i?" xn and ||M|| < 1, it is well-known that (I + M)~ l = I - M + M 2 ~ 
M 3 + ■••=/ + YT=i(- l Y Mt - Thus ' for an y U,V £ R nxn where U is nonsingular, if 
\\VU- 1 \\ < 1 we have 

(3.13) (u + vy 1 = u-\i + vu- 1 )- 1 = u- 1 + u- 1 rQ-iytyu- 1 )*] . 

As (e, 7) -> and 7/e -> 0, it follows from (|3. 121) that 

HA^CX^X^+e/)- 1 ]! < ||A £ ,7llll(^7 X ^ + eI ) _1 ll < l|A e , 7 ||/e^0. 
Thus, substituting t/ = Xj 7 X £ ^ 7 + el and F = A £j7 into p.!3p yields 

(X £ T 7 X £ , 7 + A £ , 7 + eiy 1 - (X? n X £n +eiy 1 

/ OO 

= (X £ T 7 X £ , 7 + eiy 1 ^(-l) 4 (A £ , 7 (X £ T 7 X £ , 7 + el)- 1 



Note that ||(X £ T 7 X £i7 + el)- 1 ]] < 1/e and || (I - e(X £ T 7 X £:7 + el)- 1 ) \\ < 1. When 



A 



je < 1, we have 

r n X en ((X £ T 7 X £ , 7 + A £ . 7 + eiy 1 - (X £ T 7 X £ . 7 + eiy 1 
[(X £ T 7 X £ , 7 + el) - el] ((X £ T 7 X £i7 + A e , 7 + el)- 1 - (X^X e ^ + eiy 1 

/ OO 

(I - e(X? n X £n + el)- 1 ) (A £ , 7 (X £ T 7 X £ , 7 + eiy 1 
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< \\(l-e(XTx StJ + eI) 



J^i-iyfA^iX^X^ + el)- 1 



< 



£(-1)' (X^x^x^+si)- 1 



i=l 
oo 



<E|W +^rT<£(n 

1=1 1=1 



(l|A e , 7 ||/ £ 

-(||A £ , 7 || ( 



Thus, by (|3TT2| . we have 



(3.14) tr \^X^X £n 

as (e, 7) -> and 7/e -> 0. By (O, and ([3TB]) , we have 
rank(X*) > £ (X*) > tr(F £ , 7 ) = tr (x e , 7 (Z £ , 7 + eiy 1 X £ T 7 ) 

( 



X £ X e> ~[ + A £i7 + e/ 



Xj n X E< ^ 



el 



tr 



-^e, 7 -^e,7 



A, 



tr X 



tr 



(*e.7 ( 



7 X £ , 7 + A e 



Mr* 



T 7 X £ , 7 + £/) 



X^-yX en + el) X^ n 



tr ( X"^ Xe,f 



(X £ T 7 X £ , 7 + £/) 



+ e (X e>7 ). 



which together with (|3. 141) and Lemma 2.5 implies that 
rank(X*) > limsup tr(F £7 )> liminf tr(Y^ )7 ) 

- liminf {tr(Xl X e . 7 [(XT X Er/ + A e , 7 + eiy 1 - (X £ T X en + e/)- 1 ]) 

£,7— >o, j— >o 

= liminf cj)JX E 7 ) > $ (1°) = rank(X°). 

e,7-!.0,i^0 

Since X* is a minimum rank solution, all inequalities above must be equalities, and 
thus X° is a minimum rank solution, and hm e7 _>.o,2_>o tr(^, 7 ) = rank(X*). □ 

By Theorem 3.1, we may simply set 7 = 7(e) as a function of e, for instance, 
7 = e p where p > 1 is a constant. Then (|3.5p becomes the problem below: 



(3.15) 



Minimize tr(Y) + -jUtr(Z) 



si. 



7(e)' 

r x 

X T Z + eJ 



bo, 



X T z 



0, X e C, 



which includes only one parameter. An immediate corollary from Theorem 3.1 is 
given as follows, which shows that the minimum rank of (|l.lj) can be obtained exactly 
by solving p,15j) with a suitable chosen parameter e. 

Corollary 3.2. Let C C R mxn be a bounded and closed set, containing an 
element X* with the minimum rank r* — rank(X*) and the least F-norm. Let 7 : 
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(0,oo) — > (0,oo) be a function satisfying "f(e)/e — > as £ 0. // (Y s , Z £ ,X e ) is the 
optimal solution of (3.15]) . then tr(Y e ) < r* for all e, and 

lim tr(Y £ ) = r* , lim tr(Z £ ) = \\X*\\%, 

e^O £-tO 

and the sequence {X e } converges to the set of minimum rank solutions of (1.1]) . More- 
over, there exists a threshold 5 > such that r* = \tr(Y e )~\ for every e € (0,5]. 

From the above results, we see that a rank minimization problem can be tractable 
under some conditions. We summarize this result as follows. 

Corollary 3.3. When C is defined by linear constraints (such as (1.2(1 and 
(1.3]) ), and if C contains a minimum rank element with the least F-norm, the rank 
minimization problem (1.1]) is equivalent to the SDP problem (3.5]) by a suitable choice 
of the parameter (rj,e). 

Note that the first term of the objective of fl3.5[) is to estimate rank(X) and the 
second term is to measure the least F-norm. So from Theorem 3.1 we may roughly say 
that under some conditions minimizing rank(X) over C is equivalent to minimizing 
rank(X) + /3||X|||, over C for some (3. This is true, as shown by the next result below. 

Theorem 3.4. Let the feasible set be of the form C = FP\{X : 71 < ||^||f < 72}, 
where < 71 < 72 are constants and T C R mxn j s a closed set. 

(i) The following two problems are equivalent in the sense that they yield the same 
minimum rank solution: 

(3.16) Minimize {rank(X) : X e C = T n {X : 71 < \\X\\ F < 72 }} , 

(3.17) Minimize {rank(X) + (l/rj) \\X \\% : X e C = T n {X : 71 < \\X\\ F < 72}} , 

where r\ > 72 — 71 is a given number 

(ii) If J- is a cone, then the set C contains a minimum rank matrix X* with the 
least F-norm. 

Proof, (i) Assume that X* is a minimizer of (|3.16[) with the minimum rank r* , 
and assume that X is an arbitrary minimizer of the problem (|3.17[) . We show that 
rank(X) = r*. In fact, if this is not true, then rank(X) > r* + 1, and thus 

rank(X) + (l/r?)||X||| > rank(X*) + 1 + (l/r?)||X||| 

= rank(JT) + {l/rj)\\X*f F + 1 + (l/rj) (||X||| - 

(3.18) >rank(X*) + (l/r ? )||X*||2, 

where the last inequality above follows from the fact X* ,X e {X : 71 < < 72} 

which implies that l + (l/n) (\\X\\ F - ||^*|||) > 1 + (l/?/)(7i -72) > by the choice 

of r]. Thus, (|3 . 1 8|) contradicts with the fact of X being a minimizer of (|3.17[) . 
(ii) Suppose that J- is cone. Consider the F-norm minimization problem: 

Minimize {||X||| : X e C = F n {X : ^ < \\X\\ F < 72}}. 

Since the feasible set of the problem is closed and bounded, the least F-norm solution, 
denoted by X, exists. Let X* be a minimum rank element in C. Then > 
II-^IIf > 71 > 0. Thus, there is a positive number 1 > a > such that q;||X*||f = 
Note that aX* e T (since J" is a cone), and that rank(aX*) = rank(JC). 
Thus, aX* is a minimum rank matrix with the least F-norm in C. □ 
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Before we close this section, let us make some further comments on the situation 
where C is the intersection of a cone and a bounded set defined by matrix norm, 
as discussed in Theorem 3.4. This situation does arise in the study of quadratic 
(in)equality systems and quadratic optimization. First of all, it is worth pointing out 
the following fact. Its proof is evident and omitted. 

Theorem 3.5. Let F be a cone in R mxn , and let < 71 < 72 be two positive 
numbers. Then the minimum rank r* of the rank minimization problem 

(3.19) r* = min {rank(X) : leC^Jfl {X : 71 < \\X\\ < 72 }} 

is independent of the choice 0/71,72 and the norm \\ ■ \\. 

In another word, no matter what matrix norms and the positive numbers 71,72 
are used, the problem of the form (|3.19l) yields the same minimum rank. So, in theory, 
all these rank minimization problems are equivalent. From a computation point of 
view, however, the choice of the norm || • || does matter. For instance, when F is a 
subset of the positive semidcfinitc cone, there arc some benefits of using the nuclear 
norm in (|3.19[) . Since || A||» = tr(A) in positive semidefinite cone, the constraint 

7i < ll-^ll* f; 72 in this case coincides with the linear constraint 71 < tr(X) < 72. As 
a result, the approximation counterpart, defined by (|3.5|) . of the problem (|3.19|) is an 
SDP problem for this case, and hence it can be solved efficiently. However, when the 
nuclear norm is used in (|3.19|) . the problem (|3.19j) may not satisfy the condition of 
Theorem 3.1. 

When C is defined by a cone, from Theorem 3.4 (ii) the problem p,19[) satisfies 
the condition of Theorem 3.1. However, when the F-norm is used, the problem Q3.5P is 
not convex in general. To handle this nonconvexity, we may consider the relaxation of 
(|3.19|) . For instance, when F in (|3.19j) is a cone contained in the positive semidefinite 
cone, we define 

,0 9m / Si = min{tr(A) : 71 < \\X\\ F < 72 , X h 0}, 

{ ' \ S 2 = max{tr(A) : 71 < \\X\\ F < 72, X h 0} 

where 71 > 0. Clearly, <5i and 82 exist and are positive. Thus the problem (|3.19j) is 
relaxed to 

I* = min{rank(X) : X e C = F n {X : 5i < tr(X) < S 2 }}- 

When F is defined by linear constraints, the approximation counterpart p.5[) of this 
relaxation problem is an SDP problem. Denote the optimal solution of this SDP 
problem by (Y e ^ n , Z £ ^, X £tTI ). Then by Theorem 3.1 it provides a lower bound for the 
minimum rank of the above relaxation problem, and hence a lower bound for the 
minimum rank of the original problem (|3.19|l . i.e., tr(Y e _ n ) < I* < r* . 

4. Application to the system of quadratic equations. Given a finite num- 
ber of matrices Ai 6 S n ,i = 1, ...to, we consider the development of sufficient condi- 
tions for the following assertion: 

(4.1) x T AiX = 0, i = 1, m =*> x = 0, 

i.e., is the only solution to (|1.5p . At the first glance, it seems that (1.5) and (4.1) 
have nothing to do with a rank minimization problem. In this section, however, 
we show that (4.1) can be equivalently formulated as a rank minimization problem, 
based on which we may derive some sufficient conditions for (|4.1[) by applying the 
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approximation theory developed in previous sections. Note that system (|1.5p can be 
written as (Ai, xx T ) = 0, i = 1, . . . , m. Since X = xx T is either (when x = 0) or 
a positive semidefinite rank-one matrix (when x ^ 0), it is natural to consider the 
linear system: 

(4.2) (Ai,X)=0, i=l,...,m, XhO, 

which is a homogeneous system. The set {X : (A{, X) = 0, i = 1, . . . , m, X y 0} is a 
convex cone. It is evident that the system kl.oty has a nonzero solution if and only if 
the system Ij4-2\ l has a rank-one solution. In another word, is the only solution to 
il.5\) if and only if has no rank- one solution. There are only two cases for the 
system (|4.2[) with no rank-one solution: either X = is the only matrix satisfying 
(|4.2p or the minimum rank of the nonzero matrices satisfying (|4.2p is greater than or 
equal to 2. As a result, let us consider the following rank minimization problem: 

(4.3) r* = min-jrank(X) : (Ai,X) = 0, i = 1, . . . ,m, 8 X < \\X\\ < 8 2 , X h 0} , 

where < S± < 82 are two given positive constants. Clearly, X = is the only matrix 
satisfying (|4.2j) if and only if the problem (|4.3[) is infeasible, in which case we set 
r* =00. It is also easy to see that system (14. 2p has a solution X 7^ if and only if 
the problem (|4.3[) is feasible, in which case r* is finite and 1 < r* < n. Thus for the 
problem (|4.3I) . we have either r* — 00 or 1 < r* < n. 

From the above discussion, we immediately have the following result. 

Lemma 4.1. is the only solution to system \1.5\) if and only if r* > 2 where 
r* is the minimum rank of Sty . 

Thus developing a sufficient condition for (|4.1[) can be achieved by identifying the 
condition under which the minimum rank of (|4.3I) is greater than or equal to 2. We 
follow this idea to establish some sufficient conditions for (14. ip . By Theorem 3.5, the 
optimal value r* of (|4.3p is independent of the choice of Si, 82 and || • ||. Thus Lemma 
4.1 holds for any given < #i < 8% and any prescribed matrix norm in (|4.3p . So we 
have a freedom to choose 5i , 82 and the matrix norm in (|4.3p without affecting the 
value of r* in (|4.3p . Thus, by setting 81 = 82 = 1 for simplicity and using the i^-norm 
in (|4.3I) . we have the problem 

(4.4) r* =min{rank(X) : (Ai,X) = 0, i = 1, . . . , m, \\X\\ F = 1, X h 0} . 

By Theorem 3.4(h), the feasible set of this problem contains a minimum rank solu- 
tion with the least F-norm (which is equal to 1 for this case). From Theorem 3.1 
and its corollary, the rank minimization (|4.4p can be approximated by the following 
continuous optimization problem (as (77, e) — » and n/e — » 0): 

Minimize tr(Y) + (l/r/)tr(Z) 

(4.5) z l eI )^{ x x z )to, 

(Ai,X}=0, i = l,...,m, \\X\\ F - 1, X h 0. 

(All results later in this section can be stated without involving the parameter 77 by 
setting, for instance, n = e 2 for the simplicity). By Corollary 3.2, the first term of 
the objective in the above problem provides a lower bound for the minimum rank 
of (|4.4p . However, the constraint ||A||f = 1 makes the problem (|4.5p difficult to be 
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solved directly. So let us consider a relaxation of this constraint. Similar to (|3.20p . 
we define two constants: 



(4.6) Si =min{tr(X) : \\X\\ F = 1,X h 0}, S 2 = max{tr(X) : \\X\\ F = 1, X y 0}. 

It is easy to verify that Si = 1 and 82 = \fn. In fact, in terms of eigenvalues of X, the 
above two extreme problems are nothing but minimizing and maximizing, respectively, 
the function X)"=i ^» subject to J^ i=1 A? = 1, A, > 0, i = 1, n. The optimal values 
of these two problems are 1 and y/n, respectively. Therefore, we conclude that 

{X : \\X\\ F = 1, X h 0} C {X : 1 < tr(X) < >/n, X y 0}. 

Thus, the following SDP problem is a relaxation of (|4.5[) : 

Minimize tr(Y") + (l/^)tr(Z) 

(4.7) s.t. (J ^ eJ )h0,(^ ^)b0, 

(j4j,X) =0, i = 1, to, 1 < tr(X) < y/n, X y 0. 

The optimal value of (|4.7[) is a lower bound for that of (|4.5|) . We now derive out the 
dual problem of (|4.7|) . which will be used to develop the sufficient condition for (|4.1I) . 
We will make use of the following lemma. 

Lemma 4.2. // the SDP problem is of the form 

(4.8) min{(C ,W):(Ci,W)=bi,i = l,...,l, Si<(C,W}<5 2 , WhO,} 
then its dual problem is given by 

(4.9) max \b T y + Sit x + S 2 t 2 : ^ W C< + (ti + i 2 )C ^ C , ti > 0, t 2 < o| , 

where b = (61, bi) T . 

Proof. For the standard SDP problem min{(P, W) : (Pi,W) = b iy i = 
W y 0}, it is well known that its dual problem is given by max{6 T y : 
J2i=iViPi — P}- By transforming (|4.8[) into the standard form and applying this 
fact, it is easy to verity that the dual problem of (|4.8I) is indeed given by (|4.9[) . The 
detail is omitted. □ 

To obtain the dual problem of (|4.7[) . let us rewrite the problem (14.7[) as the form 
of (|4.8p . Notice that the positive semidefinite conditions in (14. 7[) are equivalent to 



(4.10) 



/ X \ 

/ X 

X z 

Y X 

\ X Z + el j 



y 0. 



Let ij( fc >') g S 5nx5 " (k,l = l,...,5n) denote the symmetric matrices with (fc, /)th 
entry = (/, fc)th entry — 1 and zero elsewhere. When k = I, E^ k ' k ^ denotes the matrix 
with (k, k)th entry 1 and all other elements 0. Clearly, we have £^' fc ) = for 
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any (k,l). Note that for any matrix W = (wij) € S 5nx5n , it can be represented as 
W = T,lliT,^k w k,lE lk ' l) , and {E^ l \W) = w k>l + w l<k = 2w k<l for k ? I, and 
(E^ k ' k \W) — Wfe.fc. In terms of E^-'''\ the condition (|4. 10[) can be written as the 
following set of constraints: 



















(4.11) 




n+j 


,W)=0, 


i = 


1,... 


,n,j = 1, 


..,4n, 


(4.12) 




/£j(n+i, 3n+j 


,W)=0, 


i,j 


= 1, 


...,2n, 




(4.13) 






,W)=0, 


i = 


1,..., 


n-l,j = 


i + 1, n, 


(4.14) 




/^j(n+i, n+i 


,W) = 1, 


i = 


1,..., 


n, 




(4.15) 


( E {i,i) 


— 2n +J 


,W) = 0, 


i = 


1,... 


j 


= i + 1, n 


(4.16) 




— E^ nJr ^ 2n+i 


,W}=0, 


i 


1,..., 


n- 1, j = 


-i + 1, n, 


(4.17) 


(2E^ l) 


j^{n+i, 2n+i 


,W}=0, 


i = 


1,... 


,n, 




(4.18) /E( n+i,2n+ fi - 




,W)=0, 


i,3 


= 1, 


...,n, 




(4.19) [E^ in+i ' in+ ti - 


j^j(2n-\-i, 2n-\-j 


,W)=0, 


i = 


1,... 


,n- 1,3 = 


- i + 1, n, 


(4.20) {E {in+% 


, in+i) 


j^(2n+i, 2n+i 


,W) = e, 


i = 


1,... 


,n, 





where (|4.1ip and ()4.12p represent the zero blocks in the matrix of (|4.10j) . conditions 
P~Hi)) and fl~T4")) describe the block T (the identity matrix), conditions (gHHJ) - (gHHJ) 
represent the 'X' blocks, and (|4. 19|) and (I4.20P describe the relation between the blocks 
'Z' and l Z + eV therein. In terms of W G S 5nx5n , the equality {A h X) = in gj| 
can be written as (Pi,W) = 0, the inequality 1 < tr(X) < ^fn can be represented 
as 1 < (Pq,W) < y/n, and the objective of (|4.7j) can be written as (P,W) where 
Pi,P ,P £ S 5nx5n are given by 



Pi = 



A, 




P n = 



I 




p = 



( o 



V 







where / is the n x n identity matrix. Thus, (|4.7|) can be written as the following SDP 
problem: 

min (P, W) 

s.t. (E«' n+ ri,W)=0, i = l,...,n,j = l,...,4», 

( E {n+i, *n+i) jW} =0 , i} j = l,...,2n, 

^(n+i, «+i) jH/ ) =0j 4 = i,„., n — 1, j = * + l,...,n, 

^(n+i, n+i) >W ^ =1) i = 

(4.21) 2 ™+j), 17) =0, i = l,...,n-l, j = i + l,...,rc, 

_ ^(n+j, 2r l+4 ) jM/ ^ =o, »= l,...,n- 1, j = i + l,...,n, 
(2#M _ 2 «+*) ; 17) =0, i = 1, n, 

^(n-H,2n+j) „ £ ;(3n+ J , 4n+j) ( ^ = = ^ ^ 

^(4n+i, 4n+i) _ E (2n+ t , 2n+j) ^ ^ = ■ _ ^ ^ n - 1, J = 1 + 1, «, 
^(fe+i, 4« +l ) _ ^(2n +l , 2n +l ) ; ^) = £j j = 1, n , 

(Pi, 17) =0, i=l,...,m, 
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1 < (P ,W) < sfa, 



which is of the form (|4.8p . By Lemma 4.2, its dual problem is given by 

n n 

max E on + E e(5 l + ti + \fnti 



s.t. 



n— 1 n 



E E ^ (n+i,nH 



n An 2n 

EE^'-- ""' • " 

n n — 1 n 

+ Ea^ (n+4, ™ +i) +E E fejC^ -P> +i ' 2 " +J }) +^(JSCJ.<) _£;(n+J,2n+i)) 

i — 1 l — l j — 

n n 
+ V?]j(2E^ — £j(n+i,2n+i) ^ _|_ Q__^{n+ian+j) _ ^(3n+i ,4n+j) ^ 



n — 1 n 



i,j'=l 



Ql ^(4n+i,4n+j) _ j^(2n+i,2n+j)^ _|_ ^j. ^(4n+i,4n+i) _ j^(2n+i,2n+i)^ 
i—1 j—i+1 i—1 



+E^ Pi+<iPo+ ^ p °- p ' 

i=l 

h >0, t 2 < 0. 



By the structure of P, Po,E^''>'s and Pj's, the above problem can be written as 
max E Q ; + E £f3i + ll + ^ 2 ( = tr ^ $ ) ~~ Etr (Q) + 1 1 + V^) 



i=l i=l 



S.t. 



(4.22) 



V 

ti > 0, t 2 < 0, 



'5 



U 2 U 3 Ua \ 

e-v u 5 u 6 ' 

c/f -/ -e 



-e T -q / 



where a^'s and — ft's are the diagonal entries of $ and Q, respectively, i.e., diag(<!>) = 
(o>i, a n ), and diag(Q) = {—pi, —fin)- Thus the objective of the above problem 
can be written as tr(<£) — etr(Q) + t\ + \prd,2- All blocks in the above matrix are 
n x n submatrices. Also, note that (|4.22[) is always feasible and satisfies the Slater's 
condition, for instance, (0 = V = 0, $ = —I, Q — j-I, t\ = 1, t<z = —2, Hi = for all 
i = l, m, and C/^ = for all i = 1, 8) is a strictly feasible point. 

Theorem 4.3. If there exist (r),e) > and t\,t2,lii,i = l,...,m and matri- 
ces $, Q G 1 5*™ x ",l/,e G i?" x " and M< G R nxn ,i = 1,...,8 suc/i that the following 
conditions hold 



(4.23) 



ir($) - etr(Q) + ti + \fnt% - - 

V 
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> 2, ti > 0, t 2 < 0, 



/ Er=i^-(;i + i 2 )/-(^ + ^ T ) 

Ml 



(4.24) 



V 



Ml 



Mj 



Mi 

-$ 

v T ~e T 

Ml 
Ml 



M 2 M 3 M4, \ 

v-e M 5 Ms 

i/-Q M 7 M 8 

Ml i e 

Ml e T q J 



to, 



then is i/ie only solution to the quadratic equation il.5\) . 

Proof. Let X* be the minimum rank solution of (14.4[) with the least norm 
= 1. Let (Y V , E ,X V , E , Z^^) be the optimal solution to (|4.5p . by Theorem 3.1, 
we have r* > [^, e ] for every (77, e) > 0, where r* is the minimum rank of (|4.4I) . Since 
(|4.7[) is a relaxation of (|4.5p . the optimal value of (|4.7[) . denoted by v*(n,e), provides 
a lower bound for that of (I4.5[) . i.e., 

(4.25) tr(^, E ) + (l/ry)tr(^, £ ) > iTfes), 

which holds for any given (77, e) > 0. Note that (|4.22|) is the dual problem of (|4.7p . 
If the conditions <|4.23|) and (|4.24[) hold, then for this (??,e), the point (ii,i2,J/t = 
—/j, i = 1, rn, $, V, 6, C/i = —Mj,j — 1, 8) is feasible to the dual problem (|4.22j) . 
Thus, by duality theory we have 



(4.26) 



v* (77, e) > tr($) - etr(Q) + t x + y/7k 2 



Notice that (Y*,Z*,X*), where Y* = X*((X*) T X*+eI)- 1 (X*) T and Z* = (X*) T X*, 
is a feasible point of (|4.5p . Thus 



(4.27) tr(Y V)E ) + (l/r?)tr(Z,, e ) < tr(F*) + (l/»/)tr(Z*) = <fc(X*) + (I/77), 



where the last equality follows from that tr(Y*) = <p e (X*) and tr(Z* 
Combining (l4~25)) . (|4^6t and (|L27)) yields 

£ (X*) + (I/77) > tr($) - etr(Q) + t x + v^*2- 



1**111 



This together with d3T3J) implies that rank(X*) > tr($) - etr(Q)+t 1 + y/nt 2 - (I/77). 
Thus, under the conditions (|4.23[) and (|4.24|) . we see that 

r* = rankpP) > [tr($) - etr(Q) + t x + Vn~t 2 - (1/fj)] > 2. 

By Lemma 4.1, we conclude that (|4.1I) holds, i.e., is the only solution to (|1.5p . □. 

From the above result, a number of sufficient conditions stronger than (|4.23|) - 
(|4.24p can be obtained. For example, we have the following corollary. 

Corollary 4.4. Let A4 € S n , i = 1, to &e a given set of matrices. If there exist 
(77, e) > 0,rji,7j2,Mi.-.Mm G i?, <3,$ G S'™ xn and V,8s #™ x ™ sucft tftai 



(4.28) 
(4.29) 

(4.30) 



[ir($) - etr(Q) + t x + Vn~t 2 - I/77] > 2, i x > 0, t 2 < 0, 



7- e 



_o, 



7 e 
e T o 



>=o, 



2 M< - (tj + t 2 )j - (y + r T ) h 0, 



rj/ien is the only solution to the system il.5\) . 

We now point out that (|1.6I) implies (|4.28p - (|4.30p . Let 77 > be a given number. 
If tiAi >~ for some ti,i — 1, to, then we choose \ii = ati where a can be any 
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large positive number such that Y^Ly f^i^i >~ t\I where ti = 2 + — . Then conditions 
([P8]) - ([£3D]) hold with T/ = $ = Q = 9 = 0andi 2 = 0. Thus, the known condition 
(|1.6|l indeed implies (I4.28[) - (|4.29I) . For m = 2 and n > 3, since the condition P~T) is 
equivalent to HiA\ +fJ,2-A 2 y 0, the sufficient conditions in Theorem 4.4 and Corollary 
4.5 are also necessary conditions for (|4.1[) . When n = 2 or m > 3, if an example can 
be given to show that our sufficient conditions do not imply (jl.6p . then the open 
'Problem 13' in [21] would be addressed. We conjecture that our sufficient conditions 
are indeed mild ones for (|4.1|) . 

Remark 4.5. To get more simple sufficient conditions for (|4.1|) . we may continue 
to reduce the freedom of the variables in (|4.28|) - (|4.30|) . For instance, (|4.29[) can be 
replaced by a stronger version like i> < 0, 4/ h Q, Q h Y T Y without involving 
the matrix O. It is also worth stressing that checking the new sufficient conditions 
developed in this section can be achieved by solving an SDP problem. For instance, if 
the optimal value of the SDP problem (|4.22|) is greater than - + 1 , then the conditions 
([423| - (|4~24] hold. Similarly, if the optimal value of the SDP problem (|4~22|) with 
Mi = 0, i = 1, 8 is greater than ± + 1, then the conditions (|4~2^|) - (j4~30]) hold. 

5. Conclusions. Since rank(X) is a discontinuous function with an integer 
value, this makes the rank minimization problem hard to be solved directly. In this 
paper, we have presented a generic approximation approach for rank minimization 
problems through the approximation function (j) e (X). In particular, we have shown 
that when the feasible set is bounded the rank minimization problem can be approx- 
imated to any level of accuracy by a nonlinear SDP problem or a linear bilevel SDP 
problem with a special structure. To obtain a tractable approximation of the rank 
minimization with linear constraints, the approximation model p.5[) is introduced, 
and is proved to be efficient for locating the minimum rank solution of the problem if 
the feasible set contains a minimum rank element with the least F-norm. In this case, 
the rank minimization problem is equivalent to an SDP problem. This theory was 
applied to a system of quadratic equations which can be formulated as a rank mini- 
mization. Based on its approximation counterpart, we have developed some sufficient 
conditions for such a system with only zero solution. 
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